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Abstract 

We study the regularized average Renyi output entropy of quan- 
tum channels. This quantity gives information about the average noisiness 
of the channel output arising from a typical, highly entangled input state 
in the limit of infinite dimensions. We find a closed expression for ^ eg , 
a quantity which we conjecture to be equal to S T ^ S . We find an explicit 
form for /3£ es for some entanglement-breaking channels, and also for the 
qubit depolarizing channel A\ as a function of the parameter A. We prove 
equality of the two quantities in some cases, in particular we conclude that 
for Aa both are non-analytic functions of the variable A. 
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1 Introduction 

The noisiness of a quantum channel is closely related to its ability to transfer 
information, and is reflected in the values of the various channel capacities. 
Much work has been done on understanding the capacities, for example [T] 
provides a recent survey. These capacities are sometimes difficult to analyze 
directly, for example the Holevo capacity is computed using multiple output 
states. Accordingly other more mathematically tractable quantities have been 
used to measure the amount of noise introduced by the channel. One example 
is the minimal output Renyi entropy [5] of a channel A, defined for r > 1: 

S rM n(A) = min — L_ bgTr (A(\cj>) (^\)) r 
10) 1 - r 

At r = 1 this yields the minimal output von Neumann entropy, which has 
a close connection to the classical capacity of the channel [3j. The entropies 
for r > 1 also provide useful properties of the channel, and in some cases are 
easier to analyze and compute. The famous additivity conjecture concerns the 
regularized version of this quantity, which is defined as 

S?X lim -S r , min (A^) 

n— >oo 71 

(the existence of the limit is an easy consequence of the sub- additivity bound 
S rM n{A ® B) < S r , min (A) + Sr,wta(B)). While the inequality S" n s lin (.4) < 
S r ,min(A) is always true, it was an open question for several years whether 
equality holds. It is now known that equality does not hold in general [11 [5], so 
this raises the interesting question of determining S T r c ^ in (A). Except for those 
channels where additivity does hold, the value of this regularized quantity is 
unknown. For channels with non-additive Holevo capacity the classical capacity 
is also defined by such a regularized quantity, so it is an important problem 
to find new ways to calculate these regularized limits. In a sense we follow a 
strategy opposite to the random channel methods used to disprove the additivity 
conjecture; our high-dimensional channels are products of fixed channels, and 
thus are constructed explicitly. 

In the hopes of finding some new insights into these regularized channel 
properties we consider a related quantity which also measures the noisiness of the 
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channel, namely the average output Renyi entropy. This measures the entropy 
of the channel output for typical input states, rather than the smallest value 
which is used to compute the minimal output entropy. For a finite-dimensional 
channel this is defined for all r > 1 by 

S r {A) = E 

where the expectation is computed using the uniform probability measure on 
the set of input pure states. We also consider the related quantity 

Pr(A)= -^logE[Tr (A(\4>}{<j>\)) r ] 
1 — r 

Note that by Jensen's inequality 

S r (A) > /3 r (A) ■ (1) 

These quantities can be computed (at least numerically) for any given channel. 
In operational terms, they describe the long-run average output Renyi entropy 
of the channel for a sequence of random pure input states. Loosely speaking, 
they measure the average noisiness of an output state from the channel. 

As with the minimal Renyi entropy, we also consider the regularized versions 
of these quantities. However unlike the minimal Renyi entropy, the existence of 
these regularized limits is not obvious, so we define them conservatively using 
the liminf: 

S 1 ^ (A) = lim inf - S r (A® n ) (2) 

n— >oo n 

pl es (A) = liminf - MA 9 ") 

n— >oo Ji 

We conjecture that the two quantities in ^ are equal, however we do not yet 

have a proof of this for a general channel. For the specific channels we look at 
in more detail the quantity /3£ cg (A) is given by the expression above with lim inf 
replaced by lim. But from the closed expression for j3 r r efs {A) presented below 
there is a possibility for more complex limiting behavior. 

For one special class of channels we can compute a simple formula for /3£ cg (.4) 
for integer values of r. These are a subset of the entanglement breaking (E-B) 
channels [SI E], which can be written in the form A(p) = J2k CT fcTr (X^p) - 
for some states <7fc and with Xk a POVM - meeting the additional condition 
a ki) > 0. For r = 2 this includes all the E-B channels. For higher r a 
particular class of E-B channels that fulfill the condition are the QC channels 
as defined by Holevo [5], where cr^ = |fc)(fc| are pure states formed by an or- 
thonormal basis. For unital E-B channels which satisfy the condition on the a^. 
we can prove even more, namely that S r ° s (A)=P^, cs (A) — logd. 

We consider only finite-dimensional channels with equal input and output 
dimensions, and we define the dimension of the channel to be this common 
value. The identity matrix is denoted by 1. 



log Tr CA(|^D) 
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Theorem 1 (proof in 

(a) Let r > 2 be an integer, and A{p) = ^fcTr (Xkp) an entanglement 
breaking channel satisfying the condition Tr (H[=i °"A<) — /or aZZ choices 
of {hi}. Then 

pl es {A)= lim i,5 r U® n ) = logTrU(l/d) r ). 

n— >oo n 1 — 7" 

(b) Let A be a d-dimensional unital entanglement-breaking channel. A(t) = 1, 
satisfying the condition Tr (]X=i °"fei) f or a ^ integer m > 2. Then for all 
real r > 1 

5; eg (^)=^ e «(^)=iogd. 

We also derive an explicit expression for j3], cg (A) in the general case. The 
statement of this result requires some additional notation. First recall the defi- 
nition of the Choi- Jamiolkowski representation [S] of a channel, namely 

choi{A) = Y,A(\*){y\)®\x){y\ 

x,y 

where \x) and \y) are orthonormal bases of pure input states. Also let Sym(r) 
denote the symmetric group on r letters. Then every element a € Sym(r) 
defines a permutation operator on (C^)® 1 " by lZ(a)(vi ® • • • ® V r ) = v a ci) ® • • • ® 

7J a ( r ). 

Definition 2. Let .4 be a channel. For all a € Sym(r) define 

Q A . r (a) =Tr [Choi{A)® r {K{123...r)®'R.(a))] . (3) 

Furthermore, for some channels /3™ s is a simple limit. One such class are 
the entrywise positive maps defined studied in |10) . 

Definition 3. A channel A is called entrywise positive if there exist a bases for 
input and output space such that (s|„4(|x) (y|)|£) > for all x,y,s,t. 

It is clear that this definition is equivalent to Choi{A) being entrywise pos- 
itive. 



Theorem 4 (proof in [ 
(a) Let r > 2 be an integer and let Q max = m3X- a &Sym{r) \Q>A,r{°t)\- Then 

r log d - log Q max 



Pr C HA) = 



1 



(b ) Let r > 2 be an integer. If the maximum Q m ax is attained for a unique a 
then the liminf in /3£ eg can be replaced with a regular limit: 

(3 r r cs (A) = lim -(3 r (A® n ). 

n— >oo fi 



4 



(c) If the channel A is entrywise positive then for all integer r > 2 

F r cs {A) = lim -p r {A® n ). 

n— foo fl 

The evaluation of Q max seems to be a difficult problem in general for large 
values of r. However for two special permutations the quantity Q.A,r{ a ) can De 
evaluated easily, namely the identity permutation and the full cycle: 



Q A , r {id) = Tr^(l) r , Q Ar (123...r) = Tr (Choi{A) r ) . 
Thus for r = 2 the result can be stated more explicitly as follows. 
Corollary 5. Let A be a d- dimensional channel, then 

P T 2 es (A) = 21ogd-logmax[Try4(l) 2 ,Tr(C7ioi(.4) 2 )] 

To make further progress we now focus on one of the simplest cases, namely 
the qubit depolarizing channel A A , where we are able to prove a number of 
additional results. In particular using concentration of measure arguments |12[ 
[T5] we compute the regularized quantity (A\) - which we call regularized 
output entropy in the remainder of the paper - for integer values of r, and for 
a range of values of A. One interesting consequence is that this quantity is a 
non-analytic function of the depolarizing parameter A. Recall the definition of 
this channel: 

A A (p) = + 

The channel is completely positive for — 1/3 < A < 1 and is entanglement 
breaking for —1/3 < A < 1/3. 

Theorem 6 (proof in |2~6T2]) ■ 

(a) For all r G N, r > 2, and A € [0, 1], 

^(A A )= lim I / 3 r (Ar)-nin(l, 2r - 1 ° g ^ 1+3A ) r + 3 ( 1 - A )' 



r- i v — ^ / ' i 

ra->oo n y r — 1 

in particular 



$ eg (A A ) 
/3S s (Aa) 

where /3™ S (A A ) = lin Woo /3^(A A ) 



1 A<1/a/3 
2-log(l + 3A 2 ) A>l/V3 



1 A < 1/3 

2 - log(l + 3A) A > 1/3 ' 
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(b) For all r G N, r > 2, and A £ J r , 

5:° S (Aa) = /3; cs (A a ), 

where J r = [0, c r ] U [d r , 1] C [0, 1] for some < c r < d r < 1 (see TaWe[7] 
m WJl\) . 

From the explicit form given in (a) it is clear that f3 r eg (A\) does not have 
a continuous first derivative for some A r € [1/3, 1/v3]j in particular it is non- 
analytic. Because S r S (Aa) is defined as the liminf of a series upper bounded 
by 1 it is well defined for all < A < 1. Furthermore we know from (b) that 
it is equal to 1 for some range A € [0, c r ] but at A = 1 its value is 0. Therefore 
5™ S (Aa) has least one non-analytic point somewhere in the range A € [c r , d r \. 

It is tempting to associate this non-analyticity with a transition between 
distinct phases of the model, but the operational meaning of this is unclear at 
the moment. We conjecture that the quantities S r ° s (A\) and /3 r eg (A\) are in 
fact equal for all A, and for all r > 1. It is noteworthy that the channel A a is 
entanglement-breaking at and below the value A = 1/3. 

Counterexamples to the additivity conjecture have been found so far by using 
randomization techniques [T4j |X5l US [5] . This has led to an understanding of 
the behavior of a typical high-dimensional channel, at least insofar as it affects 
the minimal output Renyi entropy, by proving the generic existence of channels 
all of whose output states have high entropy. Here we look from a different point 
of view, by considering the properties of a typical output state for a product of 
many copies of a channel. Open questions remain, for example the amount of 
entanglement in a typical output state. We note that the questions addressed 
here have a different flavor from arguments based on locality, since here the 
system is fully entangled across all copies. 

2 Main result 
2.1 Notation 

We work with a general channel A and its tensor product C — A® n . The 
dimension of A is 

d = dim A 

To achieve the results for S 1 ™" and /3£ cg in Theorems Q] |4] and [6] we first find a 
closed expression for the average moments for integer r > 2 

M r (C) = E [Tr (C(|0Wir)] , (4) 

where averaging is over random pure input states \<p) = U\0) with U distributed 
according to the Haar measure on SU(d n ). We then use the relation 

/?; cs (A) = lim inf . 1 . log M r (A® n ) 
n->oo nil — r) 
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2.2 Evaluating trace moments 

We rewrite the trace moment (U) by inserting four complete sums that run over 
the entire input space 

E[Tr(C(|0)(0|D] 



E 



E 



Tr( E \a)(a\c(\x)(x\0)m(y\)\b)(b\ 



a,b,x,y 



{ai,Xi,yi} i=l 
i—l...r 



{a<,Xj,i/i} i=l 
i—l...r 



(5) 



with the identification a r +i = ai, and with 

Caxyb = (a\C(\x)(y\)\b) 
the matrix elements of the channel. The expectation value in ([5]) 



E 



YiixM (<t>\yj) 



E 



Ylixiiumoiiriyj) 

i=i 



(6) 



of products of matrix elements of unitaries distributed according to the Haar 
measure may be calculated using Weingarten calculus [17] • The Weingarten 
function Wg : N x Sym{r) —> M maps pairs of dimension k and elements of the 
symmetric group Sym(r) into the reals. The general expression is 



E[U iljl ...U irjr U iiji ...U i/rfr \ 

a.,l3^Sym(r) 

Thus © simplifies to 



n<^io><oi£/*i%-) 



E ^ (1 ,y 1 ---^ M ^Wg(fc,/3a- 1 ) (7) 

a,/3£E Sym(r) 
a£5i/m(r) 



(8) 



where A: is the input dimension for C, and in the last step the Weingarten 
function is summed over all permutations 7 = This sum can be evaluated 

explicitly, as was shown for example in [T5] : 



Ck , r = e w s(^)=n^ 



7£Sym(r) 



(9) 



7 



We plug the evaluated expectation value © in our original expression §5§ and 
get 

/ r r 

{cii,Xi,yi} 2—1 a£Sym(r) j — 1 

r 

=c k , r e n<w ( w ( io ) 



{ai,a:i} a£Sym{r) i—1 
2=1. ..r 



Now define 



Qc,r(a)= e ii Ca « x « x «.(i)«i+> = e Tr n c ^ x «w ' ^ 

{cii,Xi}i=l {xi} i=l 

i—l...r i—l...r 

where the matrices C xy have entries (C xy ) a b — C axy b or more simply C xy — 
C(\x)(y\). If it is clear from context we may omit one or both subscripts Q(a) = 
Qc{oi) = Qc,r(o). These are the terms we will analyze to a great length in the 
rest of the work. 

The Q defined in this way is identical to the one from Definition [5J This can 
be seen from the following calculation 



Tr (C7ioi(C)® r (ft(123 . . .r) <g> K(a))) 
= Y, ( a i, x i\ ® " " " ® (a r , x r \Choi(C)® r \a 2 , x a ^} ® • ■ • ® \ai,x a ^} 

{a,x} 

= E ( a il C (l a; i)( x «(i)l)l a 2)( a 2|C(|x 2 )(a; Q( 2 ) |)|a3) •• • 

{a,x} 

r 

= E n^a i x i a; Q(i) ai + i 

{a,i,Xi} i=l 
i—l...r 

=Qc,r(a). 

In terms of the Qc we get our final working expression for the average moments 
M r (C)=C k , r J2 

a£ Sym(r) 

In general Qc{a) could have complex values. However in some cases it can be 
shown to be real. In particular for the depolarizing channel it follows directly 
from C axy b > that Qc(a) is positive. 
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2.3 Product channels 

In the case where C is a tensor product C = T> ® £ we work in the product base 
| a;) = \x'x"). Now the tensor and channel application are interchangeable 

V®£ xy = V®£{\x'x"){y'y"\) 

= V{\x'){y'\)®£{\x"){y"\) 

And therefore, the Qx>®s{o) factors 



i—l...r 



Tr II%„, 8£ « ( „ 



2— l...r 



E ''il^ '-. E ^II^w 

z— l...r 2=1. ..r 

Gj>(a)efi(a). (12) 



2.4 Average moments of A® n 

If we set C = A® n the average moment factors as above, the dimension is k = d n 
and according to fT2"]) we get 

M r (A® n ) = C dn , r QaA^T- (13) 

a£Sym(r) 

When the meaning is clear from the context we suppress the index in Qa ■ The 
limiting behavior of this sum is relatively simple and determines the quantity 
f3^ es (A) as described in the following theorem. 

Theorem [4j (a) Let r > 2 be an integer and let Q ma x = max a6 Sym(r) I Qa. r i a ) 
Then 

P*Z(A\ = r log d- log g max 
r — 1 

(b ) Let r > 2 be an integer. If the maximum Q roax is attained for a unique a 
then the liminf in /3£ cg can be replaced with a regular limit: 

P r r cs {A) = lim -0r(A 9n ). 

n— >-oo fi 

(c) If the channel A is entrywise positive then for all integer r > 2 

(3 r r cs {A) = lim -P r (A® n ). 

n— too fi 
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Proof, (a) From the definition of © and (fT5]l we have 

1 



F r es (A) = liminf 



= liminf 



oo n(l — r) 
1 



log M r (A® n ) 



n->oo n(l — r) 



log 



C dnjr ^Q(a) f 



Note that 



a a 

<Q„, r ^|Q(a)r 

a 
a 

= r\d~ nr Q n 



Since M r 



< 1 we have 



> 



1 


r — 


1 


1 




r — 


1 


1 





lim inf — log 

n— too TL 



liminf- log (Hd-^Q^) 1 

n— yoo n 



r _ i - v - — - , 

__ r log d - log Q max 
_ r-1 

In order to prove equality, we will use the existence of a subsequence {fij} 
such that 



1 



lim 

j^oo rij(l - r) 



log 



< 



r log d - log Q n 
r-1 



(14) 



To this end, let {ai, . . . , a^} be the maximizers satisfying |Q(ctfj)| = Q max , 
so that Q{a.i) — Q m axe 27 ™ 7i for some ji € [0,1). It is a basic result from 
simultaneous Diophantine approximations (using the Dirichlet box principle) 
that for any e > there is an increasing sequence of positive integers rij such 
that maxi{rij7i} < e for all j, where {x} denotes the distance to the closest 
integer. Choose e = 1/6, then we have 



A' 



N 



i=l 



— -^m; 



1=1 
iV 



(15) 
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Furthermore, there is < 1 such that 

\Q(a')\ < g max -e 

for all a' which are not maximizers. Thus 

a a 

AT 

_t Cd™3 , r 2max ' " , r 



2 

N 



r! 9™' 



Since G < 1, for j sufficiently large we have y — r\Q nj > |, thus for j sufficiently 
large 



1 



n 3 (l - r) 



< 



1 1 



T -log(3S m -C- 



Using linin^oo {d nr Cd^. r ) = 1, the result follows immediately. 

(b) Let ao G Sym(r) be the unique permutation satisfying |Q(ao)| = Qmax, 
so that JV = 1 in the notation of (a). Then (|15p is replaced by the equality 

|Q(«o)"l = QILx 

which holds for every n. Thus the inequality (|14j) is true for every n, hence the 
upper and lower bound yield the existence of the limit. 

(c) If A is entrywise positive then every term Q(a) is also positive. Thus 
the inequality (|T5j) is replaced by 



N 



and this holds for every n. Thus again (|14j) is true for every n, and the result 
follows. □ 



2.5 Entanglement breaking channels 

After dealing with basic facts about /3™ s we turn our attention to the special 
case of entanglement breaking channels. In this case calculating the relevant 
Q-terms and studying their properties is particularly easy. We restate and then 
prove Theorem [1] from the introduction. 
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Theorem Q3 (a) Let r > 2 be an integer, and A(p) = ^<7fcTr(JTfcp) an 
entanglement breaking channel satisfying the condition Tr (111=1 a ki) > 
for all choices of {ki}. Then 

(3^(A) = lim -P r {A® n ) = -^logTr(.4(l/d) r ) . 

n— >oo n 1 — r 

(b) Let A be a d- dimensional unital entanglement-breaking channel, A(t) = 1. 
satisfying the condition Tr (Jl"=i a kt) for all integer m > 2. Then for all 
real r > 1 

5r g (^)=^ es (-A)=io g d. 

Proof, (a) The result follows immediately from Lemma [7] and Theorem |4] 

(b) For a unital entanglement breaking channel fi T r c ^{A) attains its maximal 

value log d which is at the same time the maximal possible value of S r S (A) . 

So, with inequality JIJ it follows that S" CS (-4) = (3^ (A) = logd for all integer 

r > 2. In the following we extend this result to all real r > 1. 

Consider the derivatives of the function f n (r) = A InE [Tr (A® n (\<j>)((l)\) r )] / In 2 

with respect to r, and set p = A® n {\(j>) ((j>\) 

/' (r) = — E [Tr p r ] 1 E [Tr (p r In p)] / In 2 < 

n 

/4'(r) = -E [Tr p r ] _1 E [Tr (p r (lnp) 2 )] / In 2 — — E [Tr p r ] ~ 2 E [Tr (p r In p)] 2 / In 2 > 

(16) 

To prove the second inequality we use two applications of the Cauchy-Schwarz 
inequality First |Tr (AB) \ < Tr (A 2 ) 1/2 Tr (B 2 ) 1/2 with A = p r / 2 and B = 
p' r / 2 Inp, and then E [XY] 2 < E [X 2 ] E [T 2 ] to deduce 



E[Tr(p r lnp)] 2 < E [(Tr (//'(hip) 2 )) 1 / 2 (Trp^ 1 / 2 



< 



[Tr(p''(lnp) 2 )] E [Tr p r ] 



Therefore, the function f n (r) is convex in r for r > 0. We also know that 
fn{r) > (1 — r) logd for any real r > 1 and lim^oo / ra (r) = (1 — r) logd for 
any integer r > 1. Therefore, for any integer ro > 1 and re [ro, r$ + 1] we have 
upper and lower bounds 



/n(ro)(ro + l-r) + / n (r + l)(r-r )>/„(r)>(l-r) logd. 



(17) 



Thus we have lim^oo /„(r) = (1 — r) logd for all real r > 1. Dividing by 1 — r 
gives limyj^oo Ag,.^®") = logd the desired equality for all r > 1. 

Finally, for r = 1 the Renyi entropy is defined as the Neumann entropy, 
which equals the limit —Si(A® n ) — lim r _n 4~ = — /«(!)■ Again, using the 



i —ri \ — T 

bound H?]) with r = 1 (and noting that /„(1) = 0) we get -/„(2) (1 - r) > 
/n(»") _: (1 — r ) logd and so 



-/„(2)<^<logd 
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which implies -/»(2) < ^(A® 11 ) < \ogd. But as lim^oo -/„(2) = logd 
both sides of the bound become equal and we have lim^oo ^Si(A® n ) = logd 
as well. □ 

Lemma 7. Let A be an entanglement breaking channel with 

^(j[°kA>0 (18) 

for all choices of {hi}, then a — id is the unique maximum of Qa r so Q max = 

Proof. Entanglement breaking channels are of the form A(p) = Y] cr^-Tr (Xkp) 
where the Ok are density matrices and the Xk constitute a POVM. Therefore 
we calculate 



|2(«) 



E T rIlA 

i— 1. . .r 



E Tl \'[[^2 cr kTr(X k \x l )(x a{l) \) 



{*<} 

i—l...r 



Ki=l k 



e ^ n ir(x fei 1^)^)0 



{xi,ki} \i=l 
i—l...r 



; = 1 



^ e Tr e n^awi^i^ 



{/=,} Vi=l 

i—l...r 



{**} i=1 

2=1.. .r 



(19) 



where condition (|18|) is used in the last step. The term on the right side of the 
last line can be rewritten as a product of traces 



E II< a: «(i)|-X'fci|a?i> = E ■■■{x a 2 {l) \X kci{1) \x a{l) ){x a{l) \Xk l \x 1 ) 



{xt} i=l 
i— l...r 



i— l...r 



= n Ti n x * 

where 7 € a are the sub-cycles of a and Hie-,-- 1 * s a P r °duct over the numbers 



m 7 



Now consider any set of operators Yj > 0, using the spectral decomposition 
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y j = Hk ^k,j\k)j(k\j we have 



TrJJl^-= A fel ,i...A fcm , m Tr(|/c 1 ) 1 (A: 1 | 1 ...| 

k"in ) m (fcm \ m ) 



.7=1. ..m 



(20) 



< 



j=l...m 



i=i 



Equality holds only in the following cases: 

• If m = 1. 

• If any of the Y}'s equals zero. 

• If all the Yj are a multiple of a one-dimensional projection. 

To see that there are no other possibilities consider the case where m > 2 and 
all Yj are rank one but they don't have the same eigenvectors. Now the sum 
in (|2U|) contains the overlap of the eigenvectors which is smaller than one in 
absolute value because some eigenvectors are not the same. Therefore there is 
no equality. Finally, consider the case where m > 2, where all the Yj have at 
least rank one and where there exists a jo such that Yj has rank two or higher. 
On the RHS of (|2U1) . choose the kj such that all A&. are non-zero. For kj there 
are two or more choices and for one of those choices the trace term has to be 
smaller than one in absolute value. Therefore equality cannot hold in this case. 

Returning to the Xj we see that if any Xj = we can drop it from our POVM 
without changing the channel A. Also if say Xi — qX2 are multiples of each 
other then we can combine them to X\ = X\ + X^ and <j\ = (o"i + goa) /(I + ?) 
again without changing A. Therefore we can assume no Xj equals zero and no 
two Xj are multiples of each other, then equality is only possible if m = 1. It 
follows that 



J|Tr Y[ X ki <[] Tr ^fe 

7Sa ig7~ 1 i=l 



can only be equality if all cycles in a have length one, i.e. a = id. 
Combining the last inequality with (jl9[) we get 



\Q( a )\ < Tr ri Tr ^ = Q ( id ) 



j=l...T 
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with equality if and only if a = id. And finally, 

c(id)= Tr (n^ )n^^ 
{k,} \*=i / <=i 

i—l...r 

= ]T Trlf[a ki TrX ki j 

{fei} \»=i / 

i— l...r 

= Tr m^c7 fc 'B:X fc J 

\i=l fe / 

□ 



2.6 The qubit depolarizing channel 
2.6.1 Evaluating Qa a (q) 

In this section we calculate Q,A x ( a ) f° r ct — id and a = (l...r). As we 
prove in Lemma [13] in 13.11 one of these two terms is always maximal, Q max = 
max{QA A (id), Qa a ((1 ■ ■ • r ))}, so they are of particular interest. 
We have 

Q AA (id) = TrA A (l) r =Trl = 2 

To evaluate the second Q-term we consider a slightly more general channel 
A with Choi-Jamiolkowski representation 

At A\ 

V K 

Ch 0l (A x )= Q k y Q . 

\A /i/ 
This matrix has diagonal block form with blocks 



A 

A [L 



v K 

K V 



We need to raise the matrix to the r-th power, which gives 



[i A 
A /i 



1 A 1 



2 VI 



(A* + A) r 

o (fj, - xy 



and similarly for the second matrix. Therefore we get 

Q A ((l...r))=Tr(Choi r Ax ) = (/i + A)' r + (/x - A) r + (v + n) r + (v - n) r . (21) 
For A — A\ we have n = 2*-, v = and n = 0, so 



QA A ((l..-r)) 



1 + 3A 



1 - A 



15 




Figure 1: S2 and S 



r 


c r 


d r 


2 


.577 


.732 


3 


.5 


.835 


4 


.458 


.878 


10 


.381 


.953 


100 


.338 


.995 



Table 1 : Range parameters of Theorem [6] 

In 14.21 we compute the Q((l . . . r)) for any dimension d > 2. 

More generally, whenever a is a product of cycles of consecutive numbers 
the sum factors as shown in LemmaEE e.g. Q((123)(45)) = ±Q((123))Q((45)). 



2.6.2 Regularized output entropy of Af n 

We are now ready to prove the main theorem. 
Theorem [H (a) For all r e N, r > 2, and X E [0, 1], 



^ es (A A )= lim -/3 r (Af l ) = 



n— >oo ft 



2r - log [(1 + 3A) r + 3(1 - X) r 
r-l 



(22) 
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in particular 

#T s (Aa) = 
^ s (Aa) = 



2-log(l + 3A 2 ) A>l/V3 

'l A<l/3 
2-log(l + 3A) A > 1/3 ' 



where /3^s(A A ) = lim™ ^{A x ). 

(b) For all r G N, r > 2, and A € J r . 

^ Cg (A A )=/3 I rc s(A A ) (23) 

where J r = [0, c r ] U [c? r , 1] C [0, 1] for some < c r < d r < 1 (see Ta&/e[7p. 

The regularized output entropy for r = 2 and the lower bound of the same 
for r = oo are plotted in Figure [TJ We expect that (|23p holds for all A. The 
regularized output entropy is maximal when 2 > +3 (^j^) which holds 

for all r when A < 1/3. It's interesting to note that A < 1/3 is also the condition 
for A A to be entanglement breaking. 

Proof, (a) From [O] we know that Q(id) = 2 and Q((l...r)) = ( iJ ^ A ) r + 
3 ("3^-) r . In Lemma [TBI in IXTI we prove that one of these two Q-terms yields 
the maximal value, i.e. for any A S [0, 1] 

, ,'l + 3A\ r „/l-A 
max < 2, 



Additionally, A\ is entrywise positive for A G [0,1]. Therefore we can apply 
parts (a) and (c) of Theorem |4] and (f22|) follows immediately. 
In the case r = 2 we have 

2-2-log[(l + 3^ + 3(l z A) 1 = 4 _ log(4 + m2) 



2 - 1 

and in the case r = oo we have 

lim ^ cg (A A ) = lira 

r— >oo r— >oc 

= lim 



= 2-log(l + 3A 2 ), 
2r — log [(1 + 3A) r + 3(1 — A) r ] 

oo t — 1 

2r-r log(l + 3A) 



r— »oo T — 1 

2 - log(l + 3A) . 



(b) Define the functions 



/(|^))=Tr(Ar(l0)^ir) 
9{X) = ^hr) X ° gX > 
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and the two series (notice the n dependency in the definition of /) 



a n = g(Ef) , 
K = % ° / • 

The only difference between a n and b n is the position at which the averaging over 
pure inputs \<f>) takes places. As a result, a n = -/3 r (A® n ) contains the average 
moments that have been considered in previous sections, and b„ — ^S r (Af n ) is 
the Renyi output entropy per systems. By Jensen's inequality and because the 
maximal output entropy is log 2 = 1 we have the bounds a n < b n < 1 for any A. 
Our goal is to prove that the two series have the same limit for A £ [0, c r ]U [d r , 1]. 
From part (a) we know 

r - log Q max 
hm a n = . 



n— >oo T — 1 

First, choose c(r) such that 2 > ( j 4p) r + 3 (^Y for all A G [0, c}. Then we 
have Qmax = 2, and so 

lim a n = lim b n — (A\) = 1 . 

n— >oo n— too 

Now only (|23|) remains to be proved for the range A 6 [d, 1] with d still to be 
determined. 

In Proposition [18] in 13.21 we prove that the Lipschitz constant r\ of the map 
/ is bounded by rj < \[2rn n for k = A + From Levy's Lemma (according 
to Lemma III. 1 in [13]) we know that the values of / concentrate around their 
average 

Pr(|/(|0» - E/| > a n ) < 4exp (-C{k + =: e„ (24) 

with k = 2 • 2" — 1 the (real) dimension of the sphere of input states, C = 
(97r 3 ln2) _1 , and we choose the deviation 

1 ( QmaxV 1 1 

2 \ 2 r J 2 

where N > 1 is the multiplicity of the maximum of Q(a), that is, a„ is half of 
the dominant term of E/ (see prove of Theorem [5]). Now a n — > (apart from 
the special case A = 1) and E/ — a n > which is required in a later step. To 

ensure concentration for large n the exponent (fc + l)-r*r needs to become large. 
Taking the 2n-th root we get 



>; 2 



l/2n / „ iV /- Q max \ 2 "\ 1/2 " 

2™-(^^J \ 



V2. 



if- ) \ 2r 2 K 2 

Qmax 
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Figure 2: Plots of both sides of (|25|) for some r values to determine the range 
of validity. 



where the approximation becomes equality in the large n limit. If this term is 
larger than 1 we have the required divergence, this gives the following inequality 

<■> 

This condition gives the lower bound d r for the A values for which (|2U)) holds. 
Both sides of the inequality are plotted in Figure For some values of r the 
range parameters can be found in Table [TJ 

To transform (|24|) into a statement about the range of g - and whence 
about a n - we need a condition that implies \f(\<f))) — Ef\ > a n . We set 
< = »( r -i) Ef a - a „ > - g(Ef - a n )\ > \g(Ef) - g(Ef + a n )\ where the 

second inequality follows from convexity of g. Then for any x we have 

\g(Ef)-g(x)\ >a' n ^\Ef-x\>a n . 

Therefore, the values of g o f concentrate around a n = g(Ef) 

Pr(| 5 o/(|0))-a n | >a' n )<e n . 

For a' n we calculate 

,. / ,• 1 a n 

nm a n = am — -, tttt 

n-i-oo n-s-oo n(r — 1) Ej — a n 

= lim — — —— 

n->oo n(r — 1) 1/2 

= 0. 
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To find an upper bound on b n assume all \<f>) G (go f)~ 1 [a n — a' nl a n +a' n ] = B 
map to the maximal value a n + a' n and all | <f>) 6 B c map to the maximal value 
1. This gives a possible range for the average 

K G [fltn, (1 - e„)(a„ + ct n ) + e„ ■ 1] . (26) 

For A G [d, 1] both e n and a' n tend to for large n and with ([26|i and because 
we know the limit of a n we have 

,. ,. , r-logQ max 
lim a n = lim o„ = . 

n— >oo n— >oo r — 1 

□ 

2.6.3 Output entropy of random sequences 

Considering sequences of random pure input states \<f> n ) with increasing dimen- 
sion 2™ we have the following statement. 

Proposition 8. Let \<j) n ) G C 2 be a sequence of random pure states and 

1 

n 

the sequence of output Renyi entropies per system. Then 

r - log Q max 



c n = -S r (Af n (|^ n )<^|)) 



a.s. 



r - 1 

if A is restricted as in Theorem [31 

Proo/. Define /„(|<£» = lS r (A® (0|)), the limit value c = ^ = 
limn^oo E/„, and £ n = |E/ n — c|. Then for any e > 

Pr(|/„(|</>» ~c\>e)< Pr(|/„(|0)) - E/„| > e - S n ) (27) 



<4exp -C(fc + 1)- , 
V V 

with C, fc = 2 • 2™ — 1, and n < \f2rn n with < k < 1 as in the proof of 
Theorem [6] If A £ J r then lim(5 Tl = 0. When n becomes large then k becomes 
large, 77 becomes small, and e — 6 n is close to e > 0. Therefore the probabilities 
in (f27|) become small. Let 2V be such that for n > N we have 5 n < e/2. Then 
we have the bound 



J2 Pr(|/ n (K)) - c| > e) < £] 4exp ( -C(fc + 1 



71=1 



<iV+ ^ 4exp^C(fc + l) (e/2) 
< iV+ J] 4exp(-C'(2//i 2 ) n 



'/ 2 



n>AT 

< OO , 
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Aqo An 

Aqi )\ -4.10 /\ 
Figure 3: The diagrams for the A xy matrices. 
-Aqo^iAloAliAlo 



Figure 4: A product of A xy matrices and the corresponding diagram. 

(e/2) 2 

with C = C ■ 2 • ^j^r-y independent of n. The Proposition follows by the lemma 
of Borel-Cantelli. □ 

3 Proof of lemmas 

3.1 Maximal Q for A A 

Before proving Proposition [T3] which is required for the proof of Theorem [5] we 
introduce some new notation and we present four lemmas. We only prove our 
Lemmas in two dimensions, but similar results will hold for higher dimensions. 

For the most part we consider a channel A slightly more general than the 
depolarization channel with 

Am ={o °) ■*»=(!! °) 

Al0 =(°\ o) Aoi= (? k o) 

where k, A, /i, v 6 Kg and /i > A > k. We call this channel the two-rail 
channel. Remember that in the case A — A\ we have fi = ^5^, f = and 
k = 0. Because all these matrix entries are positive, so are the Q-terms, and 
therefore the largest positive Q-term will yield Q max - 

We think of these matrices as the diagrams in Figure El We refer to the lines 
as rails and to their vertical position as their track (starting with track 0) 

A product looks like the diagram in Figure 0] notice that we read from right 
to left, the same way that matrix multiplication applies. Consider a vector 
multiplying with this product, the diagram can be thought of as presenting two 
rails along which the two entries of the vector pass through to the left. On the 
way, continuous lines multiply the entries with factors fi or A, dashed lines with a 
factor v or n. Because the products are inside a trace, they will only contribute, 
if the rail starting at the top on the right, ends on the top at the left (giving the 
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(0| . . . |0) contribution), and the same for the rail starting at the bottom (giving 
the (1| . . . |1) contribution). Using this, we can compare contributions to Q(a) 
for different a. 
We rewrite 

C(a)= ]T Tr(6) 

SeV(a) 

where T> (a) is the set of 2 r diagrams corresponding to a and we identify the 
diagram 5 with the corresponding matrix. 

Furthermore, for a diagram 6 we define S 1 to be the unchanged diagram 
and <5~ x to be the horizontally reflected diagram. If we take 2?o(a) to be the 
set of all diagrams in T>(a) that start high (at track 0) then obviously T>(a) = 
T>o(a) U T>Q 1 (a) provides a convenient splitting of the sum in Q(a). 

We say an a is non- overlapping if all its cycles permute consecutive numbers, 
e.g. a — (123) (45) (678). For such a we write a = a\ . . . a 8 , where all the a; are 
the cycles. We define a product for diagrams by simply concatenating them. 
With this product we get T>(a) = T>(a±) . . . T>(a s ). 

Lemma 9. Consider a diagram 6 £ 2?((1 . . . r)) of a two-rail channel. Then we 
have 

S + 5- 1 oc 1. 

Proof. In any diagram S € 2^((1 ■ • ■ r)) one rail starts and ends at track and 
the other starts and ends at track 1. That means the corresponding matrix is 
diagonal. Reflecting the diagram simply means exchanging the two diagonal 
entries, and if we sum 5 + S^ 1 then the diagonal entries both have the same 
sum, i.e. it is proportional to unity. □ 

Lemma 10. Let a — ai . . . a s £ Sym(r) be a non-overlapping permutation 
consisting of s cycles. Then the Q(a) of a two-rail channel factors like 

2( a ) = ^r2(«i)---2(^)- 

Proof. First we use the fact, that the diagrams for a non-overlapping a can be 
split between cycles, i.e. the sum splits into sums over separate diagrams, 

Q(a)= Tr W 

SeV(a) 

<5!G-D(qi) 6 s eD(a s ) 

= Tr( £ Sl -.. £ 6. 

\<SieX>(ai) S B eV(a B ) 

Using the splitting T>{a) — T>q(q) U I? ( J" 1 (a) and Lemma [9] we have 

E 5 = E s+s- 1 ^! 

sev( ai ) sev (a z ) 
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for all s cycles cti . Therefore, we can split the single trace into s separate traces 

<5i<EX>(ai) SxeT>(a c ) 

= 2^tQ(«i)...SM. 

□ 

Lemma 11. For A = A.\ the depolarizing channel in two dimensions, Q re- 
stricted to non- overlapping permutations a e Sym(r) is either maximal when 
a = (1 . . . r) or when a = id. 

Proof. Remember from l2.6lTl that Q((l •••?")) = (M + A) r + 3f r . It is convenient 
to introduce 

f(x) = ( M + \) x + 3v x , 

as a function with range R. For a pure a = cti . . . a s according to Lemma [TU] 
we get 

s(«)=in/(KD- ( 28 ) 

i=l 

with |o,| denoting the length of a cycle. 

In the following we keep the number s of cycles and the total length r of 
the permutation invariant. Now, if we increase the length of one cycle and 
decrease the length of another, we are only changing two factors in the product, 
f(x)f(k — x), where x is the length of the first cycle and k the (invariant) sum 
of the lengths of both cycles. We rewrite 

f(x)f(k -x) = ((/x + X) x + 3^)((/i + X) k - X + 3v k ~ x ) 

= ( M + X) k + %v k + iv k + 3(M + A) fc (j^Tx) X ■ 

Because of (i + A, v > the function f(x)f(k — x) is convex in x. Therefore it 
is maximal at the boundaries, i.e. when one cycle has the minimal length of 1. 
If we repeat this procedure s — 1 times we end up with one large cycle of length 
t = r — s + 1 while all other cycles are of length 1. In every step we increase Q, 
so we get the bound 

Q{a) < Q((l)(2) . . . (s - l)(s . . .r)) = ^/(lrVW = f(t) (29) 

for non-overlapping permutations a consisting of s cycles. 

Now compare the upper bounds given by ()29[) for permutations of the same 
total length r but different number of cycles s. This is the same as varying 
t . Because / is convex we get a maximal upper bound if t is minimal or 
maximal. The minimal value t = 1 is achieved when s = r and all the cycles 
are of length 1. Then (|2T)|) becomes an equality - there are no steps necessary 
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in the maximization procedure - and we have Q(id) = /(l) = 2. The maximal 
value t — r is achieved when a is simply one large cycle. Again ([29|i becomes 
equality, and Q{a) = f(r). One of these upper bounds is the highest upper 
bound possible in (|2"5|) . and because they are achieved by Q(id) and Q((l . . . r)) 
we know that one of these Q(a) is maximal over a € Sym(r). □ 

Lemma 12. Let A be a two-rail channel, and [/3] i/ie conjugacy class of a 
permutation. Then Q restricted to [j3] is maximal on non- overlapping members 

ae[p\. 

Proof. Let a — ct\ . . . a s be a non-overlapping member of the class and /3 = 
7«7 _1 be any other member of the class. First, remember 



Q(«)= ^ Tr n^J- E ••• E T±(5x...8 a ). (30) 

{xi} \i=l I 8 x £V{oti) S s €V(a s ) 

i—l...r 

With the first way of writing Q(a) in mind we define a 1-1-mapping between 
terms in the sum of Q(a) and Q{f3) via a mapping of indices Xi — » x^-i^ (or 
x i(i) ~^ x i)- Then the products of matrices are mapped like 

A XlXa{1) . . .A XrXa{T) -t A Xy(1)X ^ lf)(1)) . . ■ A x , i{T)X , i{p(r)) 
— A A 

In terms of diagrams this corresponds to permuting the "tiles" (crossings or 
straight pieces) A XiXa(i) according to the permutation 7. Some examples are 
shown in Appendix [X] In particular, this mapping does not change the number 
of any kind of tile Aoo, A01, A10 or An- 

Now, consider the second way of writing Q(ct) in ([3D]) and split the sums 
over subdiagrams T>(ai) according to V(ai) = V a {ai) U Vq 1 (a^) 

G(«)= E E E Tr(^...^). 

«i6Z>o(ai) 8 s £V (a s ) {t i= ±l} 
i—l...s 

We consider a subsum 

E Tr(^...^) 

{t»=±l} 

i— 1...S 

for fixed subdiagrams Si € 2?o(<3i)- In the following we will prove that the 
contribution of this subsum to Q(a) is larger or equal to the contribution of 
the subsum of the corresponding diagrams to Q(j3). From this it immediately 
follows that Q(a) > Q(j3). 

It follows a general proof of the inequality between corresponding subsums. 
For illustration one subsum is evaluated in full detail with diagrams in Ap- 
pendix [X] 
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For the non-overlapping permutation a in the subdiagrams Si all the straight 
lines and dashed lines are aligned, i.e. we have a weak and a strong rail. Let in; 
be the number of crossings and rij the number of straight pieces in S$. The strong 
rail in subdiagram Si contributes a factor X mi [i ni and the weak rail contributes 
the factor K mi v Ui . Summing over reflections the subsum equals 



2 I I A fl + K V 



i=l 

On the other hand, for the possibly overlapping /3 some of the tiles are permuted 
and for one particular subdiagram, not all the strong rail pieces might be on the 
same rail. Let rhi be the number of crossings that are thus misaligned and fii 
the number of straight pieces that are misaligned. The two rails in subdiagram 
Si now contribute the factors A" 1 * ""^ k™* ii ni ~ fi * v n * and A^/s 771 *-**^*^ 71 ' - "*. 
Summing over reflections the subsum adding to Q(/3) equals 



mi—mi.„m.i.,ni—ni„,ni , \nii ,„m; — mi , ,rii , ,ni—n 
K fX V + A K [I V 



Because A > Kand/i > v it follows that \ m * n n *+K m *v n ' > ^-h^h^ 
yni K mi-m,i ^rii yni-rn ^ ne S (; ron g anc j we ak rail dominate the two mixed rails), 
and hence we have the desired inequality between corresponding contributions 
to Q(a) and Q(J3). □ 

Proposition 13. For A = A\ the depolarizing channel in two dimensions, 
Q(a) is either maximal when a = (1 . . . r) or when a = id. 

Proof. First, consider permutations that consist of non-overlapping a, Lemma fTTI 
proves that either a = id or a = (1 ... r) yields the maximum Q(a) amongst 
these permutations. Every conjugacy class has a non-overlapping represen- 
tant, and Lemma [12] states that these have maximal Q- value. Therefore either 
a = id or a = (1 ... r) yield the maximal Q- value amongst all the permutations 
a G Symm(r). □ 

3.2 Bound on Lipschitz constant 

We derive an upper bound on the Lipschitz constant of the function / : S 2 + _1 — > 

K 

/(!<#) =Tr(Ar(|0W) 

with respect to the Euclidean norm on 

We divide the function into four steps and prove bounds on the Lipschitz 
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constants for each step. The splitting isf — docoboa with 

a:\4>)^\4>)^\=p 
b:p^Af n (p)=p> 
c : p' — > eigenvalues of p' = v 

d:v^^v r % . 

i 

Let M m be the space of complex m x m matrices containing the set of states 
S m = {peM m ■ p = p*,Trp = 1}. 

Lemma 14. Le£ a(|<^)) = \(f>}(4>\ a map C m -> A4 m w/iere = 1. T/ien f/ie 

Lipschitz constant of a with respect to the Euclidean norm in the domain and 
the Frobenius norm \\M\\ 2 = Tr (MM*) 1/2 in the range is upper bounded by y/2. 

Proof. For (<j>\<j>) = (ip\ip) = 1 set c = (<j>\ip). Now 

\m - mi = m) - - (v#> + «#> = 2 - 2»( C ) , 

and 



I2 = (</W - 2 I WW I + (V#> = 2 - 2\c\ 2 . 
Because of 3?(c) < |c| and the inequality derived as follows 

(l-|c|) 2 >0 
2-2|c| > 1- |c| 2 
2(2 — 2|c|) > 2-2|c| 2 , 

we arrive at 2(2-2K(c)) > 2-2|c| 2 or >/2\\\<f>) - \ip)\\ 2 > \\\(())(^\-\tp)(^\h. □ 

Lemma 15. Let b(p) = Af n (p) a map <S 2 ™ — > <?2™, then the Lipschitz constant 
of b with respect to the Frobenius norm in domain and range is upper bounded 
by K n , where n = A + so < k < 1 for < A < 1. 

Proof. It is useful to use the notation 

i\J\ 



Jcz„ ^ 2 ' 



where Tr j is the partial trace over the systems with indices in J. Notice that 
we use a loose notation of the tensor product as the systems that are partially 
traced out and replaced by the totally mixed states are not necessarily all on 
the right side of the tensor product. 

We will bound the operator norm ||-4|| op := sup p£S ^^J 2 where the supre- 
mum is over the set S — {\<fi)((f>\ — \ip)(?p\ \ \<t>), \^) & <S2™-i}- Because of 

\\A(p)-A(t)\\ 2 = \\A(p-t)\\ 2 <\\A\\ op \\p-t\\ 2 
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for a linear map A bounding ||-4|| op immediately gives a bound of the Lipschitz 
constant as well. Now 



|AflU 



< ^A"" |J| (1- A) 



Tr 



< 



E^ 



l-A 



&\j\ 



E 



k=0 



i n—k 



1- A 

~7T 



A + 



where we used the bound 



Tr { i... fc} 



l-A 

7T 



op 



sup 

pes 



Tr (Tr {1 ... fe} p® (£ 



= Tr 



Tr p 2 



sup 

pes 



Tr (Tr {1 ,„ fc} p) 
Tr p 2 



(31) 



The supremum in pip was evaluated as follows. First, consider that p = 
\4>)((j>\ — \il)){4>\ can be written as p = a|0)(0| — a|l)(l| with < a < 1, where 
|0) and |1) are orthonormal states. Then the supremum runs over all possible 
orientations of |0) and |1) 



sup 

pes 



Tr (Tr {1 ... fc} p) 
Tr p 2 



sup 

|o),|i; 



sup 



Tr (Tr {1 ... fc} (q|0)(0|-o|l)(l|))' 
TrH0)(0|-«|l)(l|) 2 

Tr(Tr {1 ... fc} (|0)(0|-|l)(l|)) 2 



10,11) Xr(|0)<0| - |1)(1|)= 



sup 

|o> ,| i; 



Tr(Tr {1 ... fe} (|0)(0|-|l)(l|)) S 



Now assume po — Tr {]....&} |0)(0| and p\ — Tr {!...&} 1 1) {1 1 are arbitrary density 
matrices. Then 



sup 

|o> ,| i; 



Tr(Tr {1 ... fe} (|0)(0|-|l)(l|)) : 



sup 

Po,Pi 



Tr(p - PiY 



Tr (p 2 +p 2 - 2p aPl ) 
sup = 1 . 

P0,Pl z 
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The last equality follows from the fact, that Trpgi < 1 an d TrpoPi > 0. The 
suprema are achieved when \cj)} = |00) and = |11) so that po = |0)(0| and 
pi = |l)(l|. □ 

Remark 16. Let c : S m — > K m be the map that sends density matrices to their 
eigenvalues, ordered high to low. The fact that the Lipschitz constant of c is 
upper bounded by 1 is equivalent to the Hoffman- Wiclandt inequality |19j 

||p-t|| 2 >||c(p)-c(t)|| 2 . 



Lemma 17. Let d(v) — J2i v l a ma P f rom G J2i w i = 1} ^° ^- Then 
the Lipschitz constant of d is upper bounded by r. 

Proof. We have Jjj- = rv r ~ x so 

sup IWI^sup^r 2 ^- 1 ) <r 2 . 

t?G Domd 

By integration we get 

\d(v) -d(w)\ <r\\v-w\\ 2 . 

□ 

Proposition 18. The Lipschitz constant!] of Tr (A®"(|0) (</>|) r ) , with respect 
to the Euclidean norm in the domain, is upper bounded by ^/2rn n , with n as in 
Lemma \15[ 

Proof. From Tr (A® ((j)\) r ) = d o c o b o a(\(f>)) with a, b, c and d as defined 
in Lemmas /Remark 11441 L Tl Thus, we simply combine the upper bounds of the 
lemmas and get the bound \J~2 • K n ■ 1 • r. □ 



4 Other results 

4.1 • • -t)) for a more general qubit channel 

For a channel A that maps that scales the Bloch sphere like 

Ai o o \ 

n -)• A 2 On, 

\o o A 3 y 

the Choi-Jamiolkowski representation is 



Choi(A x ) = 



A 








X \ 





// 


K 








K 


V 













/</ 
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K = Al 2 A2 ■ Therefore we have 



- !+ A 3 „ _ 1-A 3 
2 ' " 2 



i.e. this is a two-rail channel with /i 

have 

Q(id) = Tr„4(l)' r = Tr 1 = 2 



A = 



_ A!+A 2 



and 



and according to (|21|) 

Q((l . . . r)) = (/x + A)'' + (/x - A) r + {v + n) r + n) r 
' 1 + Ai + A 2 + A 3 \ r , / 1 - Ai - A 2 + A 3 



1 + Ai - A a - A E 



1 — Al + A2 — A; 



Assuming Q max = max{2, Q((l . . •»"))} the output is maximally mixed if |Ai| + 
|Aa| + I A3 1 < 1. But this is exactly the condition for A to be entanglement 
breaking, according to Theorem 3 in [6]. 

4.2 <2((1 •••»")) °f for any dimension d 

For the d-dimensional depolarizing channel we have Q(id) = Tr 1 = d as usual. 
Furthermore, the Choi-Jamiolkowsi representation consists of two blocks, one 
d-dimensional block with diagonal entries fx = and off-diagonal entries A, 
and another block that is v = times identity on the other d 2 — d dimensions. 
For example for d = 3 the representation looks like 

Ax A 



Choi(Ax) = 



The eigenvalues of the first block are /j, — A with (d — l)-multiplicity and a single 
eigenvalue fi + (d — 1)A. Therefore, 

Q((l . . . r)) = Tr Choi(A\) r 

= (n +(d- l)A) r + (d - l)(/i - A) r + (d 2 - d)v r 
l + (d»-l)AV + (<f _ 1 /l-A 



This result agrees with the result for = 2 found in 12.6.11 The critical value is 
A = gqHj, below this value the average output is maximally mixed. 
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5 Conclusion 



We found a general explicit form for /3 r (A) depending on the function Q_4 : 
Sym(r) — > M. In the limit n — ¥ oo the maximal term Q ma x is dominant in 
(3l° E {A) and therefore the only relevant term. However, finding Q max is not 
easy in general. In the case of the qubit depolarizing channel we proved that 
Qmax = max{2, Qa a ((1 ■ • - r))} and also that /3 r (A A ) = S r (A\) for some A. 
For all r € N and A < 1/3 the regularized output entropy becomes 1. Because 
the typical high-dimensional random state is highly entangled our result for A < 
1/3 agrees with the notion that the tensor product of an entanglement breaking 
channel "chops up" these highly entangled states and produces maximally mixed 
states with almost certain probability. 

For future work it would be interesting to also study channels other than 
the depolarizing channel, especially channels that are not known to be additive. 
Also, it might be of interest to study quantities similar to S r S with the same 
general procedure, for example E[Tr.4(£> l(\<j>) (4>\) r ]- It would be insightful to 
gain a better understanding of the typical output and input states by finding 
explicit examples that conform with the average output. 
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A Q sum diagrams 

We calculate an example of the subsums appearing in the proof of Lemma [T3J 
Let a = (123) (45), f3 = (143) (25) and 7 = (24). With this choice the correspon- 
dence of diagrams consists of switching tiles 2 and 4. The choice of indices 



X\ 


£2 


X3 


X4 


X5 





1 


1 









gives the diagrams and totals shown in Table [5] The contribution to Q(a) 
dominates in both factors as expected because a is non-overlapping. 
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